Network traffic threat identification

ABSTRACT

A computer implemented method to identify a computer security threat based on communication via a computer network including receiving a definition of acceptable network communication characteristics for each of a plurality of communication protocols; receiving a set of security events for the communication, each security event including network communication characteristics for the communication; for each security event in the set of security events: a) identifying a communication protocol associated with the event; b) detecting deviations of network communication characteristics of the event from the acceptable network communication characteristics for the identified communication protocol; and c) generating a record of each deviation identifying a communication characteristic for which the deviation is detected, and identifying a computer security threat for the communication based on the records generated for the set of security events.

CROSS-REFERENCE TO RELATED APPLICATION

The present application is a National Phase entry of PCT Application No. PCT/EP2017/055082, filed Mar. 3, 2017, which claims priority from EP Patent Application No. 16162902.7, filed Mar. 30, 2016 each of which is hereby fully incorporated herein by reference.

TECHNICAL FIELD

The present disclosure relates to the identification of network traffic threats.

BACKGROUND

Computer networks can be employed by many technologies that threaten the security, reliability, accessibility and/or availability of computing resources. Such threats can include attacks exploiting vulnerabilities in computer systems, software or networks or attacks seeking to consume or overwhelm the resources of a computing system. Threats also occur where no direct attack is involved, such as the communication or propagation of malicious software (malware) or communication between executing malware in any number of computer systems.

Software, hardware and services deployed to identify and respond to such threats can include intrusion detection systems, virus detection systems, firewalls, anti-malware, anti-spyware and the like. Often multiple such services are deployed together to improve security and a challenge arises of coordinating between the services. For example, each deployed service can generate events corresponding to occurrences detected in a network. Such events can be disparate in any number of characteristics including, inter alia: form; format; style; content; nomenclature; frequency; sensitivity; etc. Such differences can arise due to different technologies employed by services, different threats or exploits analyzed by services, or even different vendors providing such services.

To partially address these challenges security experts can adopt rules-based threat monitoring technology in which rules are defined for application to events generated by security services to identify general classes of threat for further analysis. One such threat monitoring technology is BT Assure Threat Monitoring as described in “Threat Intelligence Visibility—the way forward” (Mike Adler, BT, 2015, available from www.globalservices.bt.com/uk/en/products/assure_threat_monitoring). BT Assure Threat Monitoring identifies events of interest using defined rules and draws together resulting sets of events into “problem tickets” for further processing or analysis.

While effective in identifying threats or potential threats, the disparities between events processed by rules can lead to the identification of multiple potential threats that actually relate to a single threat. Further, the rules-based approach requires frequent revision and addition to a repository of rules to avoid new or previously unidentified threats to be detected and relationships between them to be identified.

SUMMARY

Thus there is a need to address the aforementioned challenges.

The present disclosure accordingly provides, in a first aspect, a computer implemented method to identify a computer security threat based on communication via a computer network, the method comprising: receiving a definition of acceptable network communication characteristics for each of a plurality of communication protocols; receiving a set of security events for the communication, each security event including network communication characteristics for the communication; for each security event in the set of security events: a) identifying a communication protocol associated with the event; b) detecting deviations of network communication characteristics of the event from the acceptable network communication characteristics for the identified communication protocol; and c) generating a record of each deviation identifying a communication characteristic for which the deviation is detected, and identifying a computer security threat for the communication based on the records generated for the set of security events.

In some embodiments the method further comprises: receiving a definition of one or more computer security threats, each computer security threat being defined, for each of one or more communication protocols, by a set of deviations from the acceptable network communication characteristic for the protocol, and wherein identifying a computer security threat for the communication includes comparing the records generated for the set of security events to the received definition of one or more computer security threats.

In some embodiments the definition of acceptable network communication characteristics for a communication protocol is based on a specification of the protocol.

In some embodiments one or more computer security threats is further defined by an extent or range of extents of deviation from acceptable network communication characteristics for a communication protocol.

In some embodiments one or more of the records of each deviation further identifies an extent of deviation between network communication characteristics of the security event from acceptable network communication characteristics for the identified communication protocol.

In some embodiments network communication characteristics include one or more of: a size, range of sizes, volume, range of volumes, rate and/or range or rates of data in the communication; a size or range of sizes of a header of a message in the communication; characteristics of a source entity to the communication; characteristics of a destination entity to the communication; and/or features of a communication protocol.

In some embodiments features of a communication protocol include one or more of: an order of messages or types of message according to one or more protocol definitions; and a message structure or format.

In some embodiments characteristics of the source entity include one or more of: a particular port of the source entity; a set of ports of the source entity; and a number of ports of the source entity.

In some embodiments characteristics of the destination entity include one or more of: a particular port of the destination entity; a set of ports of the destination entity; and a number of ports of the destination entity.

In some embodiments the security events are received from one or more computer security services.

The present disclosure accordingly provides, in a second aspect, a computer system including a processor and memory storing computer program code for performing the method described above.

The present disclosure accordingly provides, in a third aspect, a computer program element comprising computer program code to, when loaded into a computer system and executed thereon, cause the computer to perform the method set out above.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the present disclosure will now be described, by way of example only, with reference to the accompanying drawings, in which:

FIG. 1 is a block diagram of a computer system suitable for the operation of embodiments of the present disclosure.

FIG. 2 is a component diagram of a system to identify a computer security threat based on communication via a computer network in accordance with an embodiment of the present disclosure.

FIG. 3 is a flowchart of a method to identify a computer security threat in accordance with an embodiment of the present disclosure.

FIG. 4 is a component diagram of a system to identify a computer security threat based on communication via a computer network in accordance with an alternative embodiment of the present disclosure

FIG. 5 is a flowchart of a method to identify a computer security threat in accordance with an alternative embodiment of the present disclosure.

DETAILED DESCRIPTION

FIG. 1 is a block diagram of a computer system suitable for the operation of embodiments of the present disclosure. A central processor unit (CPU) 102 is communicatively connected to a storage 104 and an input/output (I/O) interface 106 via a data bus 108. The storage 104 can be any read/write storage device such as a random access memory (RAM) or a non-volatile storage device. An example of a non-volatile storage device includes a disk or tape storage device. The I/O interface 106 is an interface to devices for the input or output of data, or for both input and output of data. Examples of I/O devices connectable to I/O interface 106 include a keyboard, a mouse, a display (such as a monitor) and a network connection.

FIG. 2 is a component diagram of a system to identify a computer security threat based on communication via a computer network in accordance with an embodiment of the present disclosure. A computer network communication 200 is one or more network communications occurring on a computer network at a point or over a period in time. Such networks can include wired or wireless networks, physical or virtual networks, internal or public networks and the like. The communication 200 includes network messages, packets, datagrams, data transfer units, or the like communicated between a source entity and a destination entity. Such entities can be computer systems, hardware, software or firmware devices howsoever implemented and exemplars of their nature and implementation will be apparent to those skilled in the art and are beyond the scope of this description. The communication 200 employs one or more communication protocols which can be low level protocols such as link layer protocols (e.g. Ethernet), network layer protocols such as internet protocol (IP), transport level protocols such as transmission control protocol (TCP) and user datagram protocol (UDP), application layer protocols such as: domain name system protocols (DNS); media protocols such as video, audio, telephony, etc.; hypertext transfer protocol (HTTP); and specific protocols employed by applications software such as database applications, middleware software, Java software, message brokers and the like. It will be appreciated that many other communication protocols may be employed that are not mentioned here to which embodiments of the present disclosure can be applied.

For proper implementation by all interacting computing entities, a communication protocols is fully specified in a protocol specification. Such specifications can be public (such as the “request for comments” (RFC) specifications for internet protocols) or private and serve to define the characteristics of a network communication employing a protocol. For example, a protocol specification can define network communication characteristics such as: a size, range of sizes, volume, range of volumes, rate and/or range of rates of data in a communication using the protocol; a size or range of sizes of a header of a message in the communication; characteristics of a source and/or destination entities in the communication such as addressing methodologies, port usage and the like; and/or features of the communication protocol. Such features can include: a nature of connection setup for a protocol defining, for example, a protocol of message exchanges between communicating entities serving to define the commencement of a new communication, or continuation of an existing communication; a nature of ongoing communication techniques including error detection and recovery, data integrity checking, data communication retries, communication synchronization methodologies, message numbering conventions (if any), expected order of messages, expected message lengths, volume of messages and the like; message structure and/or format such as a header and payload organization and the constituents of each along with any checksum, resend or error detection mechanisms; communication close-down processes including a protocol for ceasing, terminating or closing a communication; and other such characteristics and features as will be apparent to those skilled in the art.

Communication 200 via a computer network can include communication attributable to malicious activity, malicious software, malicious intentions or communication of malicious content, payload or software or the like. For example, communication can be to transfer malicious software from one entity (source) to another (destination). Alternatively, or additionally, communication can be between one or more malicious entities and other entities, whether malicious or not. Yet further, communication can be targeted to exploit a vulnerability at an entity such as a computer system or the like, whether such entity is directly involved in the present communication or not. Many other such malicious communications can take place via the computer network as part of the communication 200 as will be apparent to those skilled in the art. All such malicious communications, along with other communications that seek to, have the potential to, may result in or otherwise cause unauthorized access, unauthorized use or unauthorized modification of a computing entity or resource shall hereinafter be referred to as a security threat.

Security services 204 a to m are provided with access to the communication 200 such as by being installed, configured or deployed at an endpoint (source or destination) of the communication 200 or at an intermediary of the communication 200 such as a switch, router, proxy, hub or the like. Alternatively, security services 204 can be provided as one or more additional entities in communication with the network on which the communication 200 takes place. For example, the security services 204 can include one or more of: an intrusion detection system; a malware detection system; a firewall; a proxy; an antivirus system; and a spyware detection system. Other suitable and/or desirable security services for network communication will be apparent to those skilled in the art. The security services 204 are adapted to generate security events such as may be prompted, occasioned or triggered by the communication 200. Thus the security services 204 monitor, watch, react-to, scan, observe or process data transmission as part of the communication 200 and, in accordance with the design and implementation of each security service 204, generate security events. For example, events of an intrusion detection service operating with transmission control protocol (TCP) communications and user datagram protocol (UDP) communications can include: “SYN Port Scan” (a technique employed by malicious software to determine the state of a communications port without establishing a full connection); “ICMP Flood” (Internet Control Message Protocol is a connectionless protocol used for IP operations, diagnostics, and errors. The sending of an abnormally large number of ICMP packets in an effort to overwhelm a target entity and reduce or deny service); “TCP Invalid Packet Size” 0; “ICMP Sweep” (also known as a “ping sweep” can be used to determine which of a range of IP addresses map to live computing entities); “TCP Port Scan” (a technique employed to identify open ports for TCP connections); “TCP Bad Option Length” (a non-compliant communications message where a specified option length does not match a specified standard for the protocol); and other security events. Security events are data items, data structures, messages, log entries or the like that indicate at least the nature of the identified security event (such as by way of an event identifier) and include network communication characteristics for the communication 200 for which the event is generated, such characteristics including at least characteristics relevant to, pertaining to, or supporting the generation of the event. For example, a security event can include: an event name or identifier; a time; a source network address for the communication 200; a source port (if relevant) for the communication 200; a destination network address for the communication 200; a destination port (if relevant) for the communication 200; and other characteristics as may support or relate to the event in question. Some events further include an identification of a communication protocol employed by the communication 200. Where an identification of the communication protocol is not included such identification can nonetheless be made with reference to the communication 200 itself or other events raised with respect to the communication 200.

Events generated by the security services 204 are combined to a security event set 222. In one embodiment the events are combined based on a rules-based security approach in which rules are defined to identify and record security events meeting certain criteria, such as, inter alia: particular combinations of particular types of event; particular volume of events; particular rate of generation of events; or combinations of or alternatives to such rules. In one embodiment the set of security events is generated as a problem ticket using a product or service such as BT Assure Threat Monitoring as previously described. Alternatively, events can be combined to an event set 222 based on predefined criteria such as, inter alia: a time window in which events occurred; the communication 200 or part of the communication 200 to which the events relate; a set or subset of the security services 204 from which the events arise; and other such suitable criteria as will be apparent to those skilled in the art.

Once generated the security event set 222 is processed by an event set processor 208 with reference to a definition of acceptable network communication characteristics 206 as described below.

The definition of acceptable network communication characteristics (ANCC) 206 includes a representation of network communication characteristics for a plurality of communication protocols that conform to a protocol specification such that communications exhibiting characteristics consistent with the ANCC 206 are considered consistent with the protocol specification. That is, where a protocol specification stipulates one or more particular characteristics for a communication protocol, those characteristics are defined in the ANCC 206. For example, if a protocol specification for a particular protocol stipulates a message header having a size ranging from 16 to 64 bytes, such range of header sizes is codified in the ANCC 206 for the protocol. Similar definitions can be included in the ANCC for conceivably any and all of the network communication characteristics described above, including inter alia: a size, range of sizes, volume, range of volumes, rate and/or range of rates of data in a communication using the protocol; a size or range of sizes of a header of a message in the communication;

characteristics of a source and/or destination entities in the communication such as addressing methodologies, port usage and the like; and/or features of the communication protocol such as: a nature of connection setup for a protocol; message structure and/or format, organization and the like; communication close-down processes including a protocol for ceasing, terminating or closing a communication; and other such characteristics and features as will be apparent to those skilled in the art.

A purpose of the ANCC 206 is to identify deviations from acceptable communication characteristics. Thus, in a simple example, where communication 200 includes a message having a header that falls outside a header size range stipulated in ANCC 206 such message and the communication can be determined to deviate from the ANCC 206. Such deviation will initially be flagged by way of a security event in the security event set 222 based on an identification by a security service 204. Such event includes network communication characteristics as previously described and this can include a header size for the communication 200. Such characteristics are compared with the ANCC 206 to identify deviation from the ANCC 206. Furthermore, in some embodiments, an extent of deviation is also determined. An extent of deviation is a degree, magnitude, amount, scope, intensity or other measure of deviation to quantify how deviant the communication 200 is with respect to the ANCC 206. Such extent can be, for example, a measure of a difference in size between a message header in the communication 200 compared to acceptable message headers according to the ANCC 206. Other measures of extent of deviation will be apparent to those skilled in the art.

The event set processor 208 is a hardware, software, firmware or combination component adapted to receive the ANCC 206 and process the security event set 222 to generate a set of deviation records 216. Each record in the set of deviation records 216 corresponds to a deviation of the communication 200 with respect to the ANCC 206 and, in some embodiments, an extent of such deviation. Thus the event set processor 208 includes a communications protocol identifier 210 for identifying a protocol for the communication 200 for which the events in the event set 222 are generated. This identification can be made with reference to one or more of the events in the event set 222 such as events explicitly referencing, indicating or identifying the protocol. The event set processor 208 further includes a deviation detector 212 and deviation recorder 214 for processing each event in turn. For each event the deviation detector 212 initially compares the network communication characteristics associated with the event with the ANCC 206 to detect a deviation. Where a deviation is detected, the deviation recorder 214 records the deviation to the deviation records 216 store. Each deviation record in the store 216 identifies the communication characteristic for which the deviation is detected. Notably, the protocol identification by identifier 208 can take place for each event in the event set 222. Alternatively, a single identification can be undertaken for all events in the set 222.

Thus the deviation records 216 identify deviations of the communication from the ANCC 206 based on security events generated by the security services 204. The deviation records 216 constitute a suitable basis for characterizing classes or categories of computer security threat since the nature and, in some embodiments, extent of deviation across network communication characteristics serve to suitably characterize a class of threat. Accordingly, types of threats can be classified and identified based on the deviation records 216. A threat identifier 218 is a hardware, software, firmware or combination component for accessing the deviation records 216 to identify a computer security threat based thereon. The threat identifier 218 can be informed of a set of one or more deviations from the ANCC 206 indicating, characterizing or otherwise identifying a computer security threat. Where such set of deviations is identified in the deviation records 218 the threat identifier 218 identifies the communication as a computer security threat.

In one embodiment a particular configuration of deviations from the ANCC 206 can be predetermined and/or predefined as one or more security threat definitions 220. Security threat definitions 220 include a definition for each of one or more computer security threats for each of one or more communication protocols. Each computer security threat is defined by a set of deviations from the ANCC 206 for a protocol. For example, security threat definitions 220 can be defined based on known security threats engaged in, involved in, or implicated in network communication for which deviations from the ANCC 206 are measured and recorded. Such deviations can be measured in a control environment for communication known to involve the security threat, or alternatively, learned or recalled from previously observer security threats. Thus, in use, the threat identifier 218 compares the deviation records 216 for a communication with security threat definitions 220 to identify a security threat. In some embodiments the identification of a subset of deviations defined for a security threat in the security threat definitions 220 is sufficient to warrant identification of a security threat, where the proportion or number of deviations for a threat that are identified in deviation records meets some predetermined threshold, or alternatively where some quorum of deviations for a security threat definition is identified.

FIG. 3 is a flowchart of a method to identify a computer security threat in accordance with an embodiment of the present disclosure. Initially at 302 the event set processor 208 receives the ANCC 206. At 304 the event set processor 208 receives the security event set 222. At 306 a loop is initiated through each event in the security event set. At 308 a communication protocol for a current event is identified by the communication protocol identifier 208. At 310 the deviation detector 212 detects deviations of network communication characteristics of the current event from the ANCC 206 for the identified protocol. At 312 the deviation recorder 214 generates a record to the deviations records 216 for each deviation. The loop continues at 314 until all events are processed. Subsequently, at 316, the threat identifier 218 identifies one or more threats based on the deviation records 216 as hereinbefore described, such as by reference to the security threat definitions 220.

One particularly advantageous application of embodiments of the present disclosure is in the identification of sets of security events that relate to common computer security threats. Typically, security analysts are required to analyze multiple prospective threats based on multiple sets of events where the sets actually relate to the same threat. Embodiments of the present disclosure can be configured to mitigate this challenge as described below.

FIG. 4 is a component diagram of a system to identify a computer security threat based on communication via a computer network in accordance with an alternative embodiment of the present disclosure. The arrangement of FIG. 4 shares many features in common with FIG. 2 and these will not be repeated here. FIG. 4 illustrates a plurality of security event sets 422 a to n each being processed by the event set processor 208 to generate a set of deviation records 416 a to n. An initial set of deviation records 416 a is alone suitable for identifying subsequent security threats by comparison of a subsequent set of deviation records (such as records 416 b) for commonality with initial set of records 416 a. Thus it can be seen that the security threat definitions 220 can be defined this way based on deviation records 416 a for, for example, known or suspected threats.

Additionally, whether deviation records 416 a are known to relate to a computer security threat or not, any subsequent set of deviation records such as records 416 b or n can be compared with records 416 a to identify commonality or identity to avoid duplicate comparisons by threat identifier 218. Thus the deviation records 416 a to n are processed by a security threat merger/deduplicator 428 (hereinafter the “merger component”). The merger component 428 is a hardware, software, firmware or combination component adapted to identify commonality between multiple set of deviation records 416 a and to provide a deduplicated set of deviation records 430 such that redundant sets of deviation records are avoided. This can be achieved by comparing deviation records such as records 416 a and 416 b and identifying identity. Where identity is identified, one of the sets of deviation records can be discarded. Where there is not complete identity, multiple options arise:

-   -   where one set of deviation records is a subset of the other, the         superset is preferably retained; or     -   where two sets of deviation records share a common subset, the         common subset is used to constitute a new set of deviation         records and the two original sets can be retained separately or         a hierarchy of deviation records can be formed in which the two         original sets depend or inherit from the new set of deviation         records with the common deviation records excluded.

Thus embodiments of the present disclosure further provide for consolidation of deviation records 416 and, therefore, consolidation of event sets 422 to aid threat identification either by human analyst of the threat identifier 218.

FIG. 5 is a flowchart of a method to identify a computer security threat in accordance with an alternative embodiment of the present disclosure. Many of the tasks of FIG. 5 are identical to those described above with respect to FIG. 3 and these will not be repeated here. At 526 of FIG. 5 the method stores a set of records of deviation 416 a as a security threat identifier for identifying subsequent security threats by comparing with the set of records. For example, the set of records of deviation 416 a can be stored to the security threat definitions 220.

Insofar as embodiments of the disclosure described are implementable, at least in part, using a software-controlled programmable processing device, such as a microprocessor, digital signal processor or other processing device, data processing apparatus or system, it will be appreciated that a computer program for configuring a programmable device, apparatus or system to implement the foregoing described methods is envisaged as an aspect of the present disclosure. The computer program may be embodied as source code or undergo compilation for implementation on a processing device, apparatus or system or may be embodied as object code, for example.

Suitably, the computer program is stored on a carrier medium in machine or device readable form, for example in solid-state memory, magnetic memory such as disk or tape, optically or magneto-optically readable memory such as compact disk or digital versatile disk etc., and the processing device utilizes the program or a part thereof to configure it for operation. The computer program may be supplied from a remote source embodied in a communications medium such as an electronic signal, radio frequency carrier wave or optical carrier wave. Such carrier media are also envisaged as aspects of the present disclosure.

It will be understood by those skilled in the art that, although the present disclosure has been described in relation to the above described example embodiments, the disclosure is not limited thereto and that there are many possible variations and modifications which fall within the scope of the claims.

The scope of the present disclosure includes any novel features or combination of features disclosed herein. The applicant hereby gives notice that new claims may be formulated to such features or combination of features during prosecution of this application or of any such further applications derived therefrom. In particular, with reference to the appended claims, features from dependent claims may be combined with those of the independent claims and features from respective independent claims may be combined in any appropriate manner and not merely in the specific combinations enumerated in the claims. 

1. A computer implemented method to identify a computer security threat based on communication via a computer network, the method comprising: receiving a definition of acceptable network communication characteristics for each of a plurality of communication protocols; receiving a set of security events for the communication, each security event including network communication characteristics for the communication; for each security event in the set of security events: identifying a communication protocol associated with the security event, detecting deviations of network communication characteristics of the security event from the acceptable network communication characteristics for the identified communication protocol, and generating a record of each deviation identifying a communication characteristic for which the deviation is detected; and identifying a computer security threat for the communication based on the records generated for the set of security events.
 2. The method of claim h further comprising: receiving a definition of one or more computer security threats, each computer security threat being defined, for each of one or more communication protocols, by a set of deviations from the acceptable network communication characteristic for the communication protocol, wherein identifying a computer security threat for the communication includes comparing the records generated for the set of security events to the received definition of one or more computer security threats.
 3. The method of claim 1 wherein the definition of acceptable network communication characteristics for a communication protocol is based on a specification of the communication protocol.
 4. The method of claim wherein one or more computer security threats is further defined by an extent or range of extents of deviation from acceptable network communication characteristics for the communication protocol.
 5. The method of claim 1 wherein one or more of the records of each deviation further identifies an extent of deviation between network communication characteristics of the security event from acceptable network communication characteristics for the identified communication protocol.
 6. The method of claim 1 wherein network communication characteristics include one or more of: a size, a range of sizes, a volume, a range of volumes, a rate or a range or rates of data in the communication; a size or a range of sizes of a header of a message in the communication; characteristics of a source entity to the communication; characteristics of a destination entity to the communication; or features of a communication protocol.
 7. The method of claim 6, wherein features of a communication protocol include one or more of: an order of messages or types of message according to one or more protocol definitions; or a message structure or format.
 8. The method of claim 6, wherein characteristics of the source entity include one or more of: a particular port of the source entity; a set of ports of the source entity; or a number of ports of the source entity.
 9. The method of claim 6, wherein characteristics of the destination entity include one or more of: a particular port of the destination entity; a set of ports of the destination entity; or a number of ports of the destination entity.
 10. The method of claim 1 wherein the security events are received from one or more computer security services.
 11. The method of claim 10, wherein one or more of the computer security services include: an intrusion detection system; a malware detection system; a firewall; a proxy; an antivirus system; or a spyware detection system.
 12. A computer system comprising: a processor and memory storing computer program code for identifying a computer security threat based on communication via a computer network, the processor and memory configured to: receive a definition of acceptable network communication characteristics for each of a plurality of communication protocols; receive a set of security events for the communication, each security event including network communication characteristics for the communication; for each security event in the set of security events: identify a communication protocol associated with the security event, detect deviations of network communication characteristics of the security event from the acceptable network communication characteristics for the identified communication protocol, and generate a record of each deviation identifying a communication characteristic for which the deviation is detected; and identify a computer security threat for the communication based on the records generated for the set of security events.
 13. A non-transitory computer-readable storage medium storing a computer program element comprising computer program code to, when loaded into a computer system and executed thereon, cause the computer to perform the method as claimed in claim
 1. 